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Abstract 

This paper presents a new mathematical formalism that describes 
the quantization of games. The study of so-called quantum games is 
quite new, arising from a seminal paper of D. Meyer [12] published in 
Physics Review Letters in 1999. The ensuing near decade has seen an 
explosion of contributions and controversy over what exactly a quan- 
tized game really is and if there is indeed anything new for game theory. 
What has clouded many of the issues is the lack of a mathematical for- 
malism for the subject in which these various issues can be clearly and 
precisely expressed, and which provides a context in which to present 
their resolution. Such a formalism is presented here, along with pro- 
posed resolutions to some of the issues discussed in the literature. One 
in particular, the question of whether there can exist equilibria in a 
quantized version of a game that do not correspond to classical cor- 
related equilibria of that game and also deliver better payoffs than 
the classical correlated equilibria is answered in the affirmative for the 
Prisoner's Dilemma and Simplified Poker. 

1 Classical Background 

As with all mathematical formalisms, it's best to begin with a definition. 

Definition 1.1. Given a set {1,2, • • • ,n} of players, for each player a set 
Si (i = 1, • • • , n) of so-called pure strategies, and a set Qi (i = 1, ■ ■ ■ ,n) of 
possible outcomes, a game G is a vector-valued function whose domain is 
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the Cartesian product of the Si 's and whose range is the Cartesian product 
of the Qi 's. In symbols 

n n 

i=i i=i 

The function G is sometimes referred to as the payoff function. 

Here a play of the game is a choice by each player of a particular strategy 
Si the collection of which forms a strategy profile (si,-- - ,s n ) whose cor- 
responding outcome profile is G(s±, ■ ■ ■ , s n ) = (u>i, • • • , w n ), where the Wj's 
represent each player's individual outcome. Note that by assigning a real 
valued utility to each player which quantifies that player's preferences over 
the various outcomes, we can without loss of generality, assume that the 
fVs are an copies of R, the field of real numbers. 

In game theory one is frequently concerned with the identification of 
special strategies or strategic profiles. For example, most players would love 
to identify a strategy that guarantees a maximal utility. As this is not usually 
possible, a security strategy, that is, a strategic choice that guarantees an 
explicit lower bound to the utility received, is also sought. But for a fixed 
(n — l)-tuple of opponents' strategies, rational players seek a best reply, that 
is a strategy s* G Si that delivers a utility at least as great, if not greater, 
than any other strategy Si £ Si. That is 

GO, ■■■,*, s*, *,•■■,*) > G(*, s^, *,■••, *) Vsj E 

A Afas/i equilibrium (NE) for G is a strategy profile (si, S2, s n ) such 
that each Sj is a best reply to the (n— l)-tuple of opponents' strategies. Other 
ways of expressing this concept include the observation that no player can 
increase his or her payoffs by unilaterally deviating from his or her equi- 
librium strategy, or that at equilibrium a player's opponents are indifferent 
to that player's strategic choice. As an example, consider the Prisoner's 
Dilemma, a two player game where each player has exactly two strategies 
(a so-called 2 x 2 or bimatrix game) whose payoff function is indicated by 
the tableau below 





h 


t 2 


Sl 


(3,3) 


(0,5) 


S2 


(5,0) 


(1,1) 



Table 1: Prisoner's Dilemma 
Here, note that for player 1 the pure strategy s 2 always delivers a higher 



2 



outcome than the strategy s\ (say S2 strongly dominates s±) and for player 
2 the strategy ti strongly dominates t\. Hence the pair (52^2) is a (unique) 
Nash Equilibrium. However, games need not have equilibria amongst the 
pure strategy profiles as exemplified by the 2x2 game of Simplified Poker 
whose payoff function is given by 





tl 


*2 


Sl 


(5/4,-5/4) 


(0,0) 


S2 


(0,0) 


(5/2,-5/2) 



Table 2: Simplified Poker 



Classical game theoretic formalism now calls upon the theorist to extend 
the game G by enlarging the domain and extending the payoff function. Of 
course, the question of if and how a given function extends is a time honored 
problem in mathematics and the careful application of the mathematics of 
extension is what will give our formalism of quantization its power. Return- 
ing to classical game theory, a standard extension at this point is to consider 
for each player the set of mixed strategies, that is, the set of probability dis- 
tributions over Si. For a given set X, denote the probability distributions 
over X by A(X) and note that when X is finite, with k elements say, the set 
A(X) is just the k — 1 dimensional simplex A^k — 1) over X, i.e., the set of 
real convex linear combinations of elements of X. Of course, we can embed 
X into A(X) by considering the element x as mapped to the probability 
distribution which assigns 1 to x and to everything else. For a given game 
G, denote this embedding of Si into A(Si) by e,. 

Now our game G can be extended to a new, larger game G mix , as follows. 
Given a profile (pi,...,p n ) of probability distributions over the Si's, by 
taking the product distribution we obtain a probability distribution over 
the product FJ S, . Taking the push out by G of this probability distribution 
we obtain a probability distribution over the image of G. By following this 
by the expectation operator we obtain the expected outcome of the mixed 
strategy profile (pi, . . . ,p n ). Assigning the expected outcome to each mixed 
strategy profile we obtain the extended game 

G mx :l[A(S l )^l[n i 

Note G m%x is a true extension of G as G mix o Ha = G; that is we have 
the following commutative diagram 
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II? = iA(Si) Product > A(n? =1 gi) pushout > A(ImG) 




Figure 1: Extension of G by G mix 

Nash's famous theorem [2] says that if Si are all finite, then there always 
exists an equilibrium in G mix . Unfortunately, this equilibrium is called a 
mixed strategy equilibrium for G, when it is not an equilibrium of G at all, 
the abusive terminology confusing G with its image, ImG. Indeed, this is 
where much of the confusion in quantum game theory begins. 

But before proceeding onto quantization, it is useful to place other clas- 
sical game theoretical ideas such as classical mediated communication and 
Aumann's notion of a correlated equilibrium into this context. One begins 
by observing that the function from Y\7=i A (Si) — > A(ImG) is not neces- 
sarily onto. As an example consider any 2x2 game G. If player 1 plays 
his first pure strategy with probability p, say, and player 2 plays her second 
pure strategy with probability q, say, the resulting probability distribution 
over the outcomes of G is given by the tableau below: 





h 


*2 


Sl 


p{l - q) 


pq 


S2 


(l-p)(l-q) 


(l-p)q 



An easy exercise now shows that the element of A(ImG) represented by 





h 


*2 


Sl 


1/2 





S2 





1/2 



is not realizable by any choice of p and q. Classical mediated communication 
addresses this issue. Suppose during pre-play negotiation the players are able 
to hire a referee for negligible cost. For a given p £ A(ImG) the referee is 
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meant to enforce p as follows. The referee secretly observes a random event 
with probability distribution p, thus determining an outcome of G. The 
referee then communicates to each player only his or her strategic choice 
which yields the observed outcome. 

Note that the players are no longer playing the game G, but in fact a 
much larger game G c ° m which is easily described for 2x2 games and whose 
generalization to games with larger strategic spaces should be clear from our 
description. Suppose the strategic space for each player is represented by 
the pair S = {A, B}. The strategic spaces for G c ° m can be represented by 
the quadruple T = {A', B', C', D'} where the strategy C' represents a player 
always cooperating with the referee, D' represents the strategy where the 
player always deviates from the referee's instruction (i.e. playing B when 
he hears A and vice- versa) , A' represents cooperating with the referee when 
A is recommended and deviating otherwise, and B' represents cooperating 
with the referee when B is recommended and deviating otherwise. 

Two other things to note here. First, if both players choose to play C' , 
then the outcome of the new game is exactly the expected outcome under 
p. Second, G c ° m extends the original game G as there are embeddings 
fi : {A, B} -» {A',B',C',D'} taking A to A' and B to B' such that G = 



G"; 



n?=i /i) as m the diagram below 




Qcom 



Figure 2: Extension of G by G c ° m 

Hence, classical mediated communication gives a family, indexed by A(ImG), 
of extensions of G. 

Following Aumann [2], a correlated equilibrium for G occurs whenever 
(C',C) is a Nash equilibrium in G c ° m . That is, the players' agreement to 
follow the referee is self policing, meaning that there is no gain to a player 
from unilateral deviating from the referee's recommendations. Note again 
the abusive terminology, the strategic choice for a correlated equilibrium is 
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not a strategic choice for G at all, but rather a strategic choice outside the 
embedded strategies for G in a larger game. Of course, the use of correlated 
equilibrium may or may not improve the lot of the players. A classic example 
of correlated equilibrium improving the players' lot is given by the variant 
of the 2x2 game of Chicken given below 





h 


t 2 


si 


(2,2) 


(0,3) 


S2 


(3,0) 


(-1,-1) 



Table 3: Chicken 



An easy exercise shows that (s2,^i) an d (si,^) are both pure strategy 
equilibria and there is a unique mixed strategy equilibrium where every 
player plays each of his or her pure strategies with equal probability. This 
mixed strategy equilibrium pays out 1 to each player. It is also easy to 
see that even without a referee any real convex linear combination of these 
three outcomes forms a self policing agreement between the players. For 
example, the players could jointly observe a fair coin and agree to play the 
(si,<2) if it falls Heads and (s2,t±) if it falls Tails. Note that the expected 
outcome of this agreement is (|, |) which is better than the outcome (1, 1) 
from the mixed strategy equilibrium. But even better and outside this 
region is the correlated equilibrium arising from the probability distribution 
|(2, 2) + |(0, 3) + |(3, 0) yielding the outcome (§, §). 

An example where mediated communication does not improve the lot of 
the players is given by Prisoner's Dilemma. One easily checks that due to the 
strong domination present in each player's strategy set, players always have 
an incentive to deviate from the referee's instruction if p assigns a non-zero 
probability to any outcome other than the Nash equilibrium (s2,i2)- This 
domination is so strong that not even mixed strategy equilibrium that assign 
non-zero probability to an outcome other than the Nash equilibrium (s2, £2) 
exists. A similar phenomenon occurs in the zero-sum game of Simplified 
Poker where any deviations from the equilibrium strategies where player I 
chooses his first strategy | of the time and player II chooses her second 
strategy | of the time is fully exploitable by the other player and hence an 
incentive to deviate from any other potential correlated equilibrium strategy. 
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2 A Formalism for Quantization 



Up until now, most authors have, like Meyer, focused their efforts on the 
quantization of the players' strategy spaces, essentially the domain of the 
payoff function that defines the game to be quantized (see for example 
El QTJ 12]). The principal technique is to identify these spaces with 
an orthogonal basis of some quantum system, in order that players may 
now take superpositions or even mixed superpositions of strategic choices 
by acting on the system via quantum operations. In addition, players may 
now correlate their strategic choices via the entanglement of the joint states 
of the system. Frequently, mere access to the higher randomization of su- 
perposition (as opposed to real probabilistic combination) or the correlation 
of strategic choices via entanglement allows payoffs to the players superior 
to those available in the game and its classical extensions. 

As a result, over the last decade, much discussion has occurred in the 
literature as to whether or not various protocols for achieving these happy 
results are indeed "quantum" or contain something new for game theory 
[U [13] . These discussion have focused on the situations when not all players 
have access to quantized strategy spaces, as in Meyer's original penny flip 
game [12J, classical methods for realizing the correlation given by entangle- 
ment exist as for the MW and EWL protocols [13] or as in the Duetchse- 
Josza algorithm [6j, and the games based on the conditional probability 
distributions found in Bell's theorem as introduced by Landsburg and Dahl 
[5] for classical duopoly, and by Iqbal and Cheon for Prisoner's Dilemma, 
Chicken, Stag Hunt, and Battle of the Sexes [9]. 

The main assertion in [3J is that all of these controversies resolve via 
the use of a mathematical formalism for game quantization that focuses on 
the quantization of the payouts of the original game G to be quantized, and 
expresses the quantized version as a (proper) extension of the original pay- 
out function in the set-theoretic sense, just as in the classical case where 
exhaustive studies are made of the extensions of classical games obtained 
from the probabilistic combinations of payoffs. As mentioned above, these 
combinations arise by the players mixing their strategic choices or by com- 
municating their strategic choices to a referee. The protocol also allows for 
fresh quantum game theoretic interpretations of a number of quantum al- 
gorithms and for a broad swath of operations in quantum logic synthesis 

Bi ma. 

Classically, we constructed the probability distributions over the out- 
comes of a game G. We now wish to pass to a more general notion of 
randomization, that of quantum superposition. Begin then with a Hilbert 
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space H that is a complex vector space equipped with an inner product. For 
the purpose here assume that 7i is finite dimensional, and that we have a 
finite set X which is in one-to-one correspondence with an orthogonal basis 
B of H. 

By a quantum superposition of X with respect to the basis B we mean 
a complex projective linear combination of elements of X; that is, a repre- 
sentative of an equivalence class of complex linear combinations where the 
equivalence between combinations is given by non-zero scalar multiplication. 
Quantum mechanics calls this scalar a phase. When the context is clear as to 
the basis to which the set X is identified, denote the set of quantum super- 
positions for X as QS(X). Of course, it is also possible to define quantum 
superpositions for infinite sets, but for the purpose here, one need not be so 
general. What follows can be easily generalized to the infinite case. 

As the underlying space of complex linear combinations is a Hilbert 
space, we can assign a length to each linear combination and, up to phase, 
always represent a projective linear combination by a complex linear com- 
bination of length 1. This process is called normalization and is frequently 
useful. 

For each quantum superposition of X we can obtain a probability dis- 
tribution over X by assigning to each component the ratio of the square of 
the length of its coefficient to the square of the length of the combination. 
For example, the probability distribution produced from ax + 0y is just 

H 2 x l/?l 2 

\a\ 2 + \(5\ 2X \a\ 2 + \(3\ 2V 

Call this function QS(X) — > A(X) a quantum measurement with respect to 
X, and note that geometrically quantum measurement is defined by pro- 
jecting a normalized quantum superposition onto the various elements of 
the normalized basis B. Denote this function by g™ eas , or if the set X is 
clear from the context, by q meas . 

Now given a finite n-player game G, suppose we have a collection Qi, . . . , Q 
of non-empty sets and a protocol, that is, a function Q : f\ Qi — ► QS(ImG). 
Quantum measurement q^Q then gives a probability distribution over ImG. 
Just as in the mixed strategy case we can then form a new game G® by ap- 
plying the expectation operator. Call the game G s thus defined to be the 
quantization of G by the protocol Q. Call the Q^s sets of pure quantum 
strategies for G®. Moreover, if there exist embeddings e\ : Si — > Qi such 
that G 2 ° EI e i = G, call G 2 a proper quantization of G. If there exist 
embeddings e'( : &{Si) -» Qi such that G Q o ]1< = G mix , call G Q a com- 
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plete quantization of G. These definitions are summed up in the following 
commutative diagram. Note for proper quantizations the original game is 
obtained by restricting the quantization to the image of \\e[. F° r general 
extensions, the Game Theory literature refers to this as "recovering" the 
game G. 



QS{ImG) 




Figure 3: A Quantization Formalism 

Note that the definitions of G mix and G® show that a complete quan- 
tization is proper. Furthermore, note that finding a mathematically proper 
quantization of a game G is now just a typical problem of extending a func- 
tion. It is also worth noting here that nothing prohibits us from having 
a quantized game G® play the role of G in the classical situation and by 
considering the probability distributions over the Qi, creating a yet larger 
game G m ^, the mixed quantization of G with respect to the protocol Q. For 
a proper quantization of G, G m ® is an even larger extension of G. The game 
QmQ j g described in the commutative diagram of Figure HI 

Note that the quantum strategy sets Qi need not consist of quantum 
superpositions, although in many quantization protocols they do, see for 
example [3 [11]. Indeed, protocols with classical inputs yielding quantum 
superpositions of the outcomes of certain games have already been posited 
[5j[9]. These and some other specific protocols are discussed in the context 
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Figure 4: Extension of G by G mQ 

of the formalism above in [3] . 

As discussed in [3] and in part the following sections, the literature gives 
several protocols for quantizing one, two, and occasionally even multi-player 
games, some improper, some proper but not complete, and some yielding 
complete quantizations. Yet there is an ongoing debate in the literature as 
to what is the 'correct' method of quantizing a game. The above formalism 
suggests that this is just the wrong question to ask, as under this formal- 
ism a given game can admit several different quantizations. It also makes 
clear that comparisons between various quantizations, between quantiza- 
tions and various classical extensions, and between quantizations and the 
original game itself often amounts to comparing "apples" to "oranges". A 
specific example appears next. 

3 Communication via the EWL Protocol 

In classical mediated communication, players have a referee mediate their 
game and the communication of their strategic choices. For simplicity, as- 
sume our players have but two classical pure strategies to choose from. The 
communication of each players strategic choices is implemented by the send- 
ing of bits to the players, put into an initial state by the referee. Presumably 
players then send back their individual bits in the other state (Flipped) or 
in the original state (Un-Flipped) to indicate the choice of their second or 
first classical pure strategy respectively. The bits are then examined by the 
referee who then makes the appropriate payoffs. 

When the communication between the referee and the players is over 
quantum channels, Eisert, Wilkens and Lewenstein [7] have proposed fami- 
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lies of quantization protocols that depend on the initial joint state prepared 
by the referee. Players and the referee communicate via qubits, a two pure 
state quantum system with a fixed observational basis. In the EWL pro- 
tocol the referee determines the outcomes via a new observational basis 
corresponding to the actions of (No Flip, No Flip), (No Flip, Flip), (Flip, 
No Flip), (Flip, Flip) by the players. Players may choose from any physical 
operation (i.e. the Lie group 5(2)) as pure quantum strategies (the Q^s in 
the formalism above) or even probabilistic combinations thereof (the AQ^s 
in the formalism) for their strategic choices. The procedure above describes 
for each initial state / a protocol Qi and a quantized and mixed quantized 
game G^ 1 and G" 1 ^ 7 per the formalism. 

If the initial state prepared by the referee is given in the Dirac notation 
by |0) <8> |0), then the EWL protocol is a complete quantization and is in fact 
equivalent to the classical game Q m%x . But when the initial state is given 
by the maximally entangled state |0) <8> |0) + |1) <S> |1), the EWL protocol 
remains complete and sets up an onto map from the product of the strategy 
spaces to A (ImG). 

It is here that a fundamental question arises; that is, are Nash equilibria 
in such a quantized games truly new? That is, is the probability distribution 
that arises from an equilibrium pair in the quantized version of game G dif- 
ferent from that arising from a classical correlated equilibrium for G? The 
maximally entangled EWL quantization described above admits a mixed 
quantum strategy equilibrium where each player uses the uniform probabil- 
ity distribution over his or her choice of pure quantum strategy [IT]. The 
resulting probability distribution over the payoffs of G is now again the uni- 
form distribution, assigning an equal probability to each of the four outcomes 
of G. Note that for the Prisoner's Dilemma, this distribution does not arise 
from a classical correlated equilibrium as it assigns a non-zero probability 
to each of the classical non-equilibrium payoffs, and so does not correspond 
to any classical correlated equilium for this game, yet delivering a payoff to 
the players superior to that of the classical pure strategy equilibrium. An 
even more remarkable result holds true for the maximally entangled EWL 
quantization of the zero-sum game of Simplified Poker, where the uniformly 
mixed quantum equilibrium out performs the classical mixed strategy equi- 
librium payoff for player I, yet is still a security strategy for player I against 
which player II has no recourse. 



11 



4 Conclusion 



The discussion above shows that for the Prisoners' Dilemma and Simpli- 
fied Poker quantization does indeed hold something new for Game Theory. 
Several other controversies can be resolved and new game theoretic interpre- 
tations of certain problems from quantum computation and quantum logic 
synthesis can be illuminated via the quantization formalism described here, 
see [3 H [TO] for details. 
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